Search CORE

Active inference, evidence accumulation, and the urn task

Author: Attias H.
Beal M. J.
Karl Friston
Michael Moutoussis
Philipp Schwartenbeck
Raymond J. Dolan
Schwartenbeck P.
Thomas H. B. FitzGerald
Publication venue: 'MIT Press - Journals'
Publication date: 01/01/2015
Field of study

Deciding how much evidence to accumulate before making a decision is a problem we and other animals often face, but one that is not completely understood. This issue is particularly important because a tendency to sample less information (often known as reflection impulsivity) is a feature in several psychopathologies, such as psychosis. A formal understanding of information sampling may therefore clarify the computational anatomy of psychopathology. In this theoretical letter, we consider evidence accumulation in terms of active (Bayesian) inference using a generic model of Markov decision processes. Here, agents are equipped with beliefs about their own behavior--in this case, that they will make informed decisions. Normative decision making is then modeled using variational Bayes to minimize surprise about choice outcomes. Under this scheme, different facets of belief updating map naturally onto the functional anatomy of the brain (at least at a heuristic level). Of particular interest is the key role played by the expected precision of beliefs about control, which we have previously suggested may be encoded by dopaminergic neurons in the midbrain. We show that manipulating expected precision strongly affects how much information an agent characteristically samples, and thus provides a possible link between impulsivity and dopaminergic dysfunction. Our study therefore represents a step toward understanding evidence accumulation in terms of neurobiologically plausible Bayesian inference and may cast light on why this process is disordered in psychopathology

The Dopaminergic Midbrain Encodes the Expected Certainty about Desired Outcomes

Author: Dolan Ray
FitzGerald Thomas H B
Friston Karl
Mathys Christoph
Schwartenbeck Philipp
Publication venue: 'Oxford University Press (OUP)'
Publication date: 23/07/2014
Field of study

Dopamine plays a key role in learning; however, its exact function in decision making and choice remains unclear. Recently, we proposed a generic model based on active (Bayesian) inference wherein dopamine encodes the precision of beliefs about optimal policies. Put simply, dopamine discharges reflect the confidence that a chosen policy will lead to desired outcomes. We designed a novel task to test this hypothesis, where subjects played a "limited offer" game in a functional magnetic resonance imaging experiment. Subjects had to decide how long to wait for a high offer before accepting a low offer, with the risk of losing everything if they waited too long. Bayesian model comparison showed that behavior strongly supported active inference, based on surprise minimization, over classical utility maximization schemes. Furthermore, midbrain activity, encompassing dopamine projection neurons, was accurately predicted by trial-by-trial variations in model-based estimates of precision. Our findings demonstrate that human subjects infer both optimal policies and the precision of those inferences, and thus support the notion that humans perform hierarchical probabilistic Bayesian inference. In other words, subjects have to infer both what they should do as well as how confident they are in their choices, where confidence may be encoded by dopaminergic firing

Sissa Digital Library

Paris Lodron University of Salzburg

Older adults fail to form stable task representations during model-based reversal inference

Author: Dolan Raymond Joseph
Duzel Emrah
Fitzgerald Thomas Henry Benedict
Gallagher Maria
Hämmerer Dorothea
Schwartenbeck Philipp
Publication venue: 'Elsevier BV'
Publication date: 13/10/2018
Field of study

Older adults struggle in dealing with changeable and uncertain environments across several cognitive domains. This has been attributed to difficulties in forming adequate task representations that help navigate uncertain environments. Here, we investigate how, in older adults, inadequate task representations impact on model-based reversal learning. We combined computational modeling and pupillometry during a novel model-based reversal learning task, which allowed us to isolate the relevance of task representations at feedback evaluation. We find that older adults overestimate the changeability of task states and consequently are less able to converge on unequivocal task representations through learning. Pupillometric measures and behavioral data show that these unreliable task representations in older adults manifest as a reduced ability to focus on feedback that is relevant for updating task representations, and as a reduced metacognitive awareness in the accuracy of their actions. Instead, the data suggested older adults choice behavior was more consistent with a guidance by uninformative feedback properties such as outcome valence. Our study highlights that an inability to form adequate task representations may be a crucial factor underlying older adults’ impaired model-based inference

Generative replay underlies compositional inference in the hippocampal-prefrontal circuit

Author: Baram Alon
Behrens Timothy
Botvinick Matthew
Dolan Raymond
Kurth-Nelson Zeb
Liu Yunzhe
Mark Shirley
Muller Timothy
Schwartenbeck Philipp
Publication venue
Publication date: 06/10/2023
Field of study

Human reasoning depends on reusing pieces of information by putting them together in new ways. However, very little is known about how compositional computation is implemented in the brain. Here, we ask participants to solve a series of problems that each require constructing a whole from a set of elements. With fMRI, we find that representations of novel constructed objects in the frontal cortex and hippocampus are relational and compositional. With MEG, we find that replay assembles elements into compounds, with each replay sequence constituting a hypothesis about a possible configuration of elements. The content of sequences evolves as participants solve each puzzle, progressing from predictable to uncertain elements and gradually converging on the correct configuration. Together, these results suggest a computational bridge between apparently distinct functions of hippocampal-prefrontal circuitry and a role for generative replay in compositional inference and hypothesis testing

Dopaminergic basis for signalling belief updates, but not surprise, and the link to paranoia

Author: Adams Rick A.
Coello Christopher
Dahoun Tarik
Dolan Raymond J
Fitzgerald Thomas H.B.
Howes Oliver D.
Nour Matthew M.
Schwartenbeck Philipp
Wall Matthew B.
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 01/01/2018
Field of study

Distinguishing between meaningful and meaningless sensory information is fundamental to forming accurate representations of the world. Dopamine is thought to play a central role in processing the meaningful information content of observations, which motivates an agent to update their beliefs about the environment. However, direct evidence for dopamine’s role in human belief updating is lacking. We addressed this question in healthy volunteers who performed a model-based functional magnetic resonance imaging (fMRI) task designed to separate the neural processing of meaningful and meaningless sensory information. We modelled participant behaviour using a normative Bayesian observer model, and used the magnitude of the model-derived belief update following an observation to quantify its meaningful information content. We also acquired positron emission tomography (PET) imaging measures of dopamine function in the same subjects. We show that the magnitude of belief updates about task structure (meaningful information), but not pure sensory surprise (meaningless information), are encoded in midbrain and ventral striatum activity. Using PET we show that the neural encoding of meaningful information is negatively related to dopamine-2/3 receptor availability in the midbrain and dexamphetamine-induced dopamine release capacity in the striatum. Trial-by-trial analysis of task performance indicated that subclinical paranoid ideation is negatively related to behavioural sensitivity to observations carrying meaningful information about the task structure. The findings provide direct evidence implicating dopamine in model-based belief updating in humans, and have implications for understating the pathophysiology of psychotic disorders where dopamine function is disrupted

King's Research Portal

Elsevier - Publisher Connector

Active inference and learning

Author: Abbott
Alagoz
Attias
Averbeck
Balleine
Balleine
Barlow
Barto
Barto
Baxter
Beal
Bellman
Bonet
Botvinick
Botvinick
Braun
Brown
Buzsaki
Cooper
Daw
Daw
Dayan
Delamater
Dezfouli
Dolan
Duff
Everitt
Fiorillo
FitzGerald
FitzGerald
FitzGerald
Francesco Rigoli
Frank
Friston
Friston
Friston
Friston
Friston
Friston
Friston
Giovanni Pezzulo
Gläscher
Howard
Howard
Humphries
Humphries
Huys
Itti
Jaynes
John O⿿Doherty
Kahneman
Karl Friston
Keramati
Kesner
Klyubin
Knutson
Krebs
Laughlin
Lee
Linsker
Mannella
Mirza
Montague
Moser
Moutoussis
Mushiake
Oliehoek
Olshausen
Ortega
Pearson
Pennartz
Penny
Pezzulo
Pezzulo
Pezzulo
Pezzulo
Pezzulo
Philipp Schwartenbeck
Preuschoff
Ravindran
Schmidhuber
Schultz
Schwartenbeck
Schwartenbeck
Schwartenbeck
Schwartenbeck
Sella
Solway
Still
Stoianov
Thomas FitzGerald
Thrailkill
van den Broek
van der Meer
Verschure
Wang
Williams
Wittmann
Yin
Zak
Publication venue: 'Elsevier BV'
Publication date: 01/01/2016
Field of study

This paper offers an active inference account of choice behaviour and learning. It focuses on the distinction between goal-directed and habitual behaviour and how they contextualise each other. We show that habits emerge naturally (and autodidactically) from sequential policy optimisation when agents are equipped with state-action policies. In active inference, behaviour has explorative (epistemic) and exploitative (pragmatic) aspects that are sensitive to ambiguity and risk respectively, where epistemic (ambiguity-resolving) behaviour enables pragmatic (reward-seeking) behaviour and the subsequent emergence of habits. Although goal-directed and habitual policies are usually associated with model-based and model-free schemes, we find the more important distinction is between belief-free and belief-based schemes. The underlying (variational) belief updating provides a comprehensive (if metaphorical) process theory for several phenomena, including the transfer of dopamine responses, reversal learning, habit formation and devaluation. Finally, we show that active inference reduces to a classical (Bellman) scheme, in the absence of ambiguity

Optimal inference with suboptimal models:Addiction and active Bayesian inference

Author: Adams
Adams
Bechara
Beck
Beck
Belin
Berke
Berridge
Berridge
Bickel
Bishop
Blum
Blum
Blum
Bowers
Bowers
Brighton
Cain
Charness
Christoph Mathys
Clark
Clark
Comings
Conant
Daunizeau
Daw
Dayan
Dayan
Diana
Dolan
Doyon
Edwards
Everitt
Everitt
Eysenck
FitzGerald
Friedrich Wurst
Friston
Friston
Friston
Friston
Friston
Friston
Geisler
Gigerenzer
Gillan
Graybiel
Grether
Griffiths
Griffiths
Henden
Hommer
Jones
Kahnemann
Karl Friston
Kersten
Knill
Lee
MacKay
Mahmood
Maia
Martin Kronbichler
Miedl
Montague
Monterosso
Pellicano
Peters
Peters
Philipp Schwartenbeck
Pouget
Ray Dolan
Redish
Redish
Robinson
Schwartenbeck
Simon
Simon
Stephan
Story
Sutton
Taber
Tenenbaum
Thomas H.B. FitzGerald
Tversky
Tversky
Weiss
Whalley
Willuhn
Wolpert
Yu
Publication venue: 'Elsevier BV'
Publication date: 15/12/2014
Field of study

When casting behaviour as active (Bayesian) inference, optimal inference is defined with respect to an agent's beliefs - based on its generative model of the world. This contrasts with normative accounts of choice behaviour, in which optimal actions are considered in relation to the true structure of the environment - as opposed to the agent's beliefs about worldly states (or the task). This distinction shifts an understanding of suboptimal or pathological behaviour away from aberrant inference as such, to understanding the prior beliefs of a subject that cause them to behave less 'optimally' than our prior beliefs suggest they should behave. Put simply, suboptimal or pathological behaviour does not speak against understanding behaviour in terms of (Bayes optimal) inference, but rather calls for a more refined understanding of the subject's generative model upon which their (optimal) Bayesian inference is based. Here, we discuss this fundamental distinction and its implications for understanding optimality, bounded rationality and pathological (choice) behaviour. We illustrate our argument using addictive choice behaviour in a recently described 'limited offer' task. Our simulations of pathological choices and addictive behaviour also generate some clear hypotheses, which we hope to pursue in ongoing empirical work

Sissa Digital Library

Recommended from our members

Active Inference: A Process Theory

Author: Attias H.
Barlow H.
Barto A.
Botvinick M.
FitzGerald T.
Francesco Rigoli
Friston K.
Giovanni Pezzulo
Jaynes E. T.
Karl Friston
Klyubin A. S.
Mirza M. B.
Philipp Schwartenbeck
Schmidhuber J.
Thomas FitzGerald
van den Broek J. L.
Publication venue: 'MIT Press - Journals'
Publication date: 21/11/2016
Field of study

This article describes a process theory based on active inference and belief propagation. Starting from the premise that all neuronal processing (and action selection) can be explained by maximizing Bayesian model evidence—or minimizing variational free energy—we ask whether neuronal responses can be described as a gradient descent on variational free energy. Using a standard (Markov decision process) generative model, we derive the neuronal dynamics implicit in this description and reproduce a remarkable range of well-characterized neuronal phenomena. These include repetition suppression, mismatch negativity, violation responses, place-cell activity, phase precession, theta sequences, theta-gamma coupling, evidence accumulation, race-to-bound dynamics, and transfer of dopamine responses. Furthermore, the (approximately Bayes’ optimal) behavior prescribed by these dynamics has a degree of face validity, providing a formal explanation for reward seeking, context learning, and epistemic foraging. Technically, the fact that a gradient descent appears to be a valid description of neuronal activity means that variational free energy is a Lyapunov function for neuronal dynamics, which therefore conform to Hamilton’s principle of least action

City Research Online